PCT 



^WORLD INTELLECTUAL PROPERTY ORGANIZATION 

Internationa) Bureau 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 
C12N 15/86, 7/00, 5/10 



Al 



(11) International Publication Number: WO 99/14354 

(43) International Publication Date: 25 March 1999 (25.03.99) 



(21) International Application Number: PCT/US98/ 19479 

(22) International Filing Date: 18 September 1998 (18.09.98) 



(30) Priority Data: 

60/059,330 



1 9 September 1 997 ( 1 9.09.97) US 



(71) Applicant (for all designated States except US): THE 

TRUSTEES OF THE UNIVERSITY OF THE PENN- 
SYLVANIA [US/US]; Suite 300, 3700 Market Street, 
Philadelphia, PA 19104-3147 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): WILSON, James, M. 
[US/US]; 1350 N. Avignon Drive, Gladwyne, PA 19035 
(US). XIAO, Weidong [CN/US]; Apartment P4, 155 Wash- 
ington Lane, Jenkintown, PA 19046 (US). 

(74) Agents: KODROFF, Cathy, A. et al.; Howson and Howson, 
Spring House Corporate Center, P.O. Box 457, Spring 
House, PA 19477 (US). - 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, GM, HR, HU, ID, IL, IS, JP, KE, KG, KP, KR, KZ, 
LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, 
MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO 
patent (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, 
IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: METHODS AND VECTOR CONSTRUCTS USEFUL FOR PRODUCTION OF RECOMBINANT AAV 
(57) Abstract 



Methods for efficient production of recombinant AAV employ a host cell containing a first nucleic acid molecule comprising from 5' 
to 3\ a parvovirus P5 promoter, a spacer, an AAV rep sequence and an AAV cap gene sequence, wherein said spacer is of sufficient size 
to reduce expression of the repl% and rep6% gene products; a second nucleic acid molecule comprising a minigene comprising a transgene 
flanked by AAV inverse terminal repeats (ITRs) and under the control of regulatory sequences directing expression thereof in a host cell; 
and helper functions essential to the replication and packaging of rAAV, which functions are not provided by the first or second nucleic 
acid molecules. Host cells and molecule constructs are also described. 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/14354 PCTAJS98/ 19479 



METHODS AND VECTOR CONSTRUCTS USEFUL FOR PRODUCTION 

OF RECOMBINANT AAV 

This invention was made with financial assistance from the National 
Institutes of Health Grant No. NIAMS P01 AR/MS43648. The United States 
5 government has certain rights in this invention. 

Background pf the Inventipn 

Adeno-associated virus (AAV) is a replication-deficient parvovirus, the 
genome of which is about 4.6 kb in length, including 145 bp inverted terminal repeats 
(ITRs). Two open reading frames encode a series of rep and cap polypeptides. Rep 

10 polypeptides (rep78, re/?68, rep62 and repAO) are involved in replication, rescue and 
integration of the AAV genome. The cap proteins (VP1, VP2 and VP3) form the 
virion capsid. Flanking the rep and cap open reading frames at the 5' and 3' ends are 
the 145 bp ITRs, the first 125 bp of which are capable of forming Y- or T-shaped 
duplex structures. Of importance for the development of AAV vectors, the entire rep 

15 and cap domains can be excised and replaced with a therapeutic or reporter transgene 
[B. J. Carter, in "Handbook of Parvoviruses", ed., P. Tijsser, CRC Press, pp. 155-168 
(1990)]. It has been shown that the ITRs represent the minimal sequence required for 
replication, rescue, packaging, and integration of the AAV genome. 

When this nonpathogenic human virus infects a human cell, the viral 

20 genome integrates into chromosome 19 resulting in latent infection of the cell. 

Production of infectious virus and replication of the virus does not occur unless the 
cell is coinfected with a lytic helper virus, such as adenovirus or herpesvirus. Upon 
infection with a helper virus, the AAV provirus is rescued and amplified, and both 
AAV and helper virus are produced. The infecting parental ssDNA is expanded to 

25 duplex replicating form (RF) DNAs in a rep dependent manner. The rescued AAV 
genomes are packaged into preformed protein capsids (icosahedral symmetry 
approximately 20 nm in diameter) and released as infectious virions that have 
packaged either + or - ss DNA genomes following cell lysis. 
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AAV possesses unique features that make it attractive as a vector for 
delivering foreign DNA to cells. Various groups have studied the potential use of 
AAV in the treatment of disease states; however, progress towards establishing AAV 
as a transducing vector for gene therapy has been slow for a variety of reasons. One 
5 obstacle to the use of AAV for delivery of DNA is lack of highly efficient schemes for 
encapsidation of recombinant genomes and production of infectious virions [See, R. 
Kotin, Hum. Gene Ther. . 5:793-801 (1994)]. 

One proposed solution involves transfecting the recombinant adeno- 
associated virus (rAAV) containing the transgene into host cells followed by co- 

10 infection with wild-type AAV and adenovirus. However, this method leads to 

unacceptably high levels of wild-type AAV. Incubation of cells with rAAV in the 
absence of contaminating wild-type AAV or helper adenovirus is associated with little 
recombinant gene expression. In the absence of rep, integration is inefficient and not 
directed to chromosome 19. 

15 A widely recognized means for manufacturing transducing AAV 

virions entails co-transfection with two different, yet complementing plasmids. One of 
these contains the therapeutic or reporter transgene sandwiched between the two cis 
acting AAV ITRs. The AAV components that are needed for rescue and subsequent 
packaging of progeny recombinant genomes are provided in trans by a second plasmid 

20 encoding the viral open reading frames for rep and cap proteins. However, both rep 
and cap are toxic to the host cells. This toxicity has been the major source of 
difficulty in providing these genes in tram for the construction of a useful rAAV gene 
therapy vector. 

Other methods have been proposed to enable high titer production of 
25 rAAV. For example, US Patent No. 5,658,776 refers to packaging systems and 
processes for packaging AAV vectors that replace the AAV P5 promoter with a 
heterologous promoter. Alternatively, US Patent No. 5,622,856 refers to constructs 
and methods for AAV vector production, which provide constructs formed by moving 
the homologous P5 promoter to a position 3' to the rep genes, and optionally flanking 
30 the rep-cap and repositioned P5 promoter with FRT sequences. 
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There remains a need in the art for additional methods permitting the 
efficient production of AAV and recombinant AAV viruses for use in research and 
therapy. 

Summary of the Tnvention 
5 The present invention provides novel methods, host cells, and vector 

constructs which permit efficient production of rAAV, by decreasing the expression of 

the repWrep6S gene products, while leaving the expression of rep52, repAO and 

AAV structural proteins at a normal level. 

In one aspect, the invention provides a host cell containing 
10 (a) a first nucleic acid molecule comprising from 5' to 3', a 

parvovirus P5 promoter, a spacer, an AAV rep sequence and an AAV cap sequence, 

wherein the spacer is of sufficient size to reduce expression of the repl% and rep6Z 

gene products; 

(b) a second nucleic acid molecule comprising a minigene 
15 comprising a transgene flanked by AAV inverse terminal repeats (ITRs) and under the 

control of regulatory sequences directing expression thereof in a host cell; and 

(c) helper functions essential to the replication and 

packaging of rAAV. 

In another aspect, the invention provides a nucleic acid molecule useful 
20 in the production of recombinant AAV comprising from 5' to 3', a homologous P5 
promoter, a spacer, an AAV rep sequence and an AAV cap sequence, wherein the 
spacer is of sufficient size to reduce, but not eliminate, expression of the replS and 

rep6Z gene products. 

In yet a further aspect, the invention provides a method for increasing 
25 the production of recombinant adeno-associated virus (rAAV) by culturing a host cell 
as described above, by which the repWrep6% gene products are reduced in 
expression, and isolating from the cell lysate or cell culture, high levels of recombinant 
AAV capable of expressing said transgene. 
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Other aspects and advantages of the present invention are described 
further in the following detailed description of the preferred embodiments thereof 

Brief Description of the Drawings 
5 Fig. 1 A is a schematic illustration of a naturally occurring AAV nucleic 

acid sequence illustrating the P5 promoter 5' to the start site (ATG) of the rep and 

cap gene sequences. 

Fig. IB is a schematic illustration of a first nucleic acid sequence of the 

present invention showing spacer 'X' inserted between the P5 promoter and the start 
10 site of rep and cap gene sequences. 

Fig. 2 A is a schematic of plasmid pFG140, a commercially available 

(Microbix Biosystems, Inc.) plasmid containing a substantial portion of the adenovirus 

type 5 genome except for the Ela and Elb genes. This plasmid may be used to 

provide helper functions in the method of the invention. 
15 Fig. 2B is a schematic of a smaller plasmid pFa13, obtained by 

digesting pFG140 with RsrII, removing the smaller RsrII fragment and religating the 

plasmid. This plasmid may also be used to provide helper functions in the method of 

the invention. 

Detailed Description of the Invention 

20 The invention provides methods and compositions for efficiently 

producing high titers of r AAV. The method of this invention may be employed to 
produce rAAV carrying therapeutic transgenes, which are particularly useful in 
transferring the transgene to a host cell or tissue. These rAAV are also useful as 
research reagents, or as tools for the recombinant production of a transgene product 

25 in vitro. 



I. Compositions 

In one embodiment, the invention provides a host cell which contains 

the following components: 
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(a) a first nucleic acid molecule comprising from 5' to 3', a 
parvovirus P5 promoter, a spacer, an AAV rep gene sequence and an AAV cap gene 
sequence, wherein the spacer is of sufficient size to reduce expression of the repl% 
and rep6% gene products relative to other rep gene products; 
5 (b) a second nucleic acid molecule comprising a minigene 

comprising a transgene flanked by AAV inverted terminal repeats (ITRs) and under 
the control of regulatory sequences directing expression thereof in a host cell; and 
(c) helper functions essential to the replication and packaging of 

rAAV. 

10 A. The First Nucleic A cid Molecule 

The key components of the first molecule are arranged in 5' to 
3* order: the parvovirus P5 promoter, a spacer interposed between the promoter and 
the start site of the rep gene sequence, and the cap gene sequence. 

The parvovirus P5 promoter used in the first nucleic acid 

15 molecule is preferably homologous to the AAV serotype which provides the rep gene 
sequences and cap gene sequences. Alternatively, the promoter may be a P5 
promoter from another AAV type than that which provides the rep and cap 
sequences. The AAV P5 promoter sequences, as well as the ITR sequences employed 
in the second nucleic acid molecule described below, may be obtained from any 

20 known AAV, including presently identified human AAV types. Similarly, AAVs 

known to infect other animals may also provide the P5 promoter, rep and cap gene 
sequences, and ITRs employed in the constructs of this invention. The selection of 
the AAV to provide any of these sequences is not anticipated to limit the following 
invention. For example, the P5 promoter may be provided by AAV type 1, AAV type 

25 2, AAV type 3, AAV type 4, AAV type 5, parvovirus type HI, MVM, LuIII, or from 
any other parvovirus or AAV serotype. A variety of AAV strains are available from 
the American Type Culture Collection or are available by request from a variety of 
commercial and institutional sources. In the following exemplary embodiments an 
AAV-2 is used for convenience. 
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The spacer is a DNA sequence interposed between the 
promoter and the rep gene ATG (start) site. The spacer may have any desired 
design; that is, it may be a random sequence of nucleotides, or alternatively, may 
encode a gene product, such as a marker gene. The spacer may contain genes which 

5 typically incorporate start/stop and polyA sites. The spacer may be a non- coding 
DNA sequence from a prokaryote or eukaryote, a repetitive non-coding sequence, a 
coding sequence without transcriptional controls or coding sequences with 
transcriptional controls. As illustrated below, two exemplary sources of spacer 
sequence are the X phage ladder sequences or yeast ladder sequences, which are 

10 available commercially, e.g., from Gibco or Boehringer Mannheim, among others. 

The spacer may be of any size sufficient to reduce expression of 
the repl% and rep6S gene products, leaving the repS2 y rep40 and cap gene products 
to be expressed at normal levels. The length of the spacer may therefore range from 
about 10 bp to about 10.0 kbp. As illustrated below spacers of 100 bp to about 8.0 

15 kbp in length were used effectively. In one experimental design, maximum expression 
levels of replS and rep6S were achieved with a spacer of about 500 bp in length. 
Desirably, to reduce the possibility of recombination, the spacer is preferably less than 
2 kbp in length. However, the invention is not so limited. 

The rep gene sequences and cap gene sequences are obtained from the same or a 
20 different serotype of AAV from that which supplies the P5 promoter. These 

sequences may be contiguous, or may be non-contiguous sequences, as desired, and 
may be derived from a single AAV or from different AAV sources. The AAV rep and 
cap sequences, as well as the P5 promoter may be obtained by conventional means 
(see Example 1 below). In all cases, in the first nucleic acid molecule, the P5 
25 promoter and spacer are 5' to the rep cap sequences. 

The first nucleic acid molecule may be in any form which 
transfers these components to the host cell. As one example, the first nucleic acid 
molecule is preferably in the form of a plasmid, which may contain other non-viral or 
viral sequences. However, this molecule does not contain the AAV ITRs and 
30 generally does not contain the AAV packaging sequences. As one example, the first 
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nucleic acid molecule described in Example 1 below contains a plasmid sequence from 
the commercially available Bluescript plasmid. A series of such plasmids are identified 
by the designation pJWX-Y, with Y indicating a different size of spacer. As another 
example, a plasmid may contain the key components described above, and further 
5 contain adenovirus sequences, such as map units 0-1 and 9-16 thereof as well as 
plasmid sequence. This plasmid is desirably constructed so that it may be stably 
transfected into a cell. 

Alternatively, the first nucleic acid molecule may be in the form 
of a recombinant virus, such as an adenovirus or baculovirus. For example, the key 

10 components may be inserted as a "minigene" into the El region of an El -deleted 

adenovirus vector. See, e.g., published PCT patent application Nos. W096/13598; 
W096/13597 and US Patent No. 5,652,224, among others. 

The first nucleic acid molecule may also exist in the host cell as 
an episome. Still alternatively, the molecule, or at least the key components described 

15 in detail below, may be integrated into the chromosome of the host cell. 

The methods employed for constructing a molecule of this invention are conventional 
genetic engineering or recombinant engineering techniques such as those described in 
conventional texts. See, e.g., Sambrook et al, Molecular Cloning. A Laboratory 
Manual. 2d edition, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 

20 (1989). While Example 1 provides a specific illustration of the first nucleic acid 

molecule of this invention, using the information provided herein, one of skill in the 
art may select and design other suitable first nucleic acid molecules, with the choice of 
spacers, P5 promoters and the like, taking into consideration such factors as length, 
the presence of at least one set of translational start and stop signals, and optionally, 

25 the presence of polyadenylation sites. 

B. The Second Nucleic A cid Molecule 

The second nucleic acid molecule provides in cis a minigene, 
which is defined sequences which comprise a selected desired transgene, a promoter, 
and other regulatory elements necessary for expression of the transgene in a host cell, 

30 flanked by AAV inverse terminal repeats (ITRs). 
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The AAV sequences employed are preferably the cis-acting 5' 
and 3* inverted terminal repeat (ITR) sequences [See, e.g., B. J. Carter, in "Handbook 
of Parvoviruses", ed., P. Tijsser, CRC Press, pp. 155-168 (1990)]. The ITR sequences 
are about 145 bp in length. Preferably, substantially the entire sequences encoding the 

5 ITRs are used in the vectors, although some degree of minor modification of these 
sequences is permissible for this use. The ability to modify these ITR sequences is 
within the skill of the art. [See, e.g., texts such as Sambrook et al, "Molecular 
Cloning. A Laboratory Manual.", 2d edit., Cold Spring Harbor Laboratory, New 
York (1989); Carter et al, cited above; and K. Fisher et al, J. Virol., 70:520-532 

10 (1996). As described above, the AAV source of such sequences is not a limitation 
upon this invention. 

In one embodiment, the 5' and 3' AAV ITR sequences flank the 
selected transgene sequence and associated regulatory elements (i.e., the 5 5 AAV ITR 
is 5' of the transgene and the regulatory elements and the 3 5 AAV ITR is 3' of the 

15 transgene and regulatory elements). The transgene sequence of the second molecule 
is a nucleic acid sequence, heterologous to the AAV sequence, which encodes a 
polypeptide or protein of interest. The composition of the transgene sequence 
depends upon the use to which the resulting second molecule is to be put. For 
example, one type of transgene sequence includes a reporter sequence, which upon 

20 expression produces a detectable signal. Such reporter sequences include without 
limitation an E. coli beta-galactosidase (LacZ) cDN A, an alkaline phosphatase gene 
and a green fluorescent protein gene. These sequences, when associated with 
regulatory elements which drive their expression, provide signals detectable by 
conventional means, e.g., ultraviolet wavelength absorbance, visible color change, etc. 

25 For example, where the transgene is the LacZ gene, the presence of rAAV is detected 
by assays for beta-galactosidase activity. 

However, desirably, the second molecule carries a non-marker 
gene which can be delivered to an animal via the rAAV produced by this method. A 
preferred type of transgene sequence is a therapeutic gene which expresses a desired 

30 gene product in a host cell. These therapeutic nucleic acid sequences typically encode 



WO 99/14354 a a PCT/US98/19479 



9 

products which, upon expression, are able to correct or complement an inherited or 
non-inherited genetic defect or treat an epigenetic disorder or disease. However, the 
selected transgene may encode any product desirable for study. The selection of the 
transgene sequence is not a limitation of this invention. 
5 In addition to the major elements identified above, the minigene 

of the second molecule also includes conventional regulatory elements necessary to 
drive expression of the transgene in a cell transfected with this vector. Thus, the 
minigene comprises a selected promoter which is operatively linked to the transgene 
and located, with the transgene, between the AAV ITR sequences. Selection of the 

10 promoter used to drive expression of the transgene is a routine matter and is not a 
limitation of the vector. 

In a preferred embodiment, the transgene is under the control 
of a cytomegalovirus (CMV) immediate early promoter/enhancer [see, e.g., Boshart et 
al, Cell , 41:521-530 (1985)]. However, other suitable promoters may be readily 

15 selected by one of skill in the art. Useful promoters may be constitutive promoters or 
regulated (inducible) promoters, which will enable control of expression of the 
transgene product. Another suitable promoter is the Rous sarcoma virus LTR 
promoter/enhancer. Still other promoter/enhancer sequences may be selected by one 
of skill in the art. 

20 The minigene also desirably contains heterologous nucleic acid 

sequences including sequences providing signals required for efficient polyadenylation 
of the transcript and introns with functional splice donor and acceptor sites. A 
common poly-A sequence which is employed in the exemplary vectors of this 
invention is that derived from the papovavirus SV-40. The poly-A sequence generally 

25 is. inserted following the transgene sequences and before the 3' AAV ITR sequence. A 
common intron sequence is also derived from SV-40, and is referred to as the SV-40 
T intron sequence. A minigene of the present invention may also contain such an 
intron, desirably located between the promoter/enhancer sequence and the transgene. 
Selection of these and other common vector elements are conventional and many such 

30 sequences are available [see, e.g., Sambrook et al, and references cited therein]. 
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The second nucleic acid molecule carrying the AAV ITRs 
flanking the minigene may be in any form which transfers these components to the 
host cell. As described above for the first nucleic acid molecule, the second molecule 
may contain a plasmid backbone. For example, the second nucleic acid molecule of 
5 Example 2 is in the form of a plasmid containing other viral or non-viral sequences. 
The plasmid may further contain adenovirus sequences, such as map units 0-1 and 9- 
16. 

Alternatively, the second nucleic acid molecule may be in the 
form of a recombinant virus which is used to infect the host cell. The second 

10 molecule may be a recombinant replication-defective adenovirus containing the 
transgene operatively linked to expression control sequences in the region of an 
adenovirus El deletion. Suitable Ad/ AAV recombinant viruses may be produced in 
accordance with known techniques. See, e.g., International patent applications 
W096/13598, published May 9, 1996; WO 95/23867 published Sept. 8, 1995, and 

15 WO 95/06743 published March 9, 1995, which are incorporated by reference herein. 

As either a plasmid or a virus, the second nucleic acid molecule 
may exist in the host cell as an episome or may be integrated into the chromosome of 
the host cell. 

The methods useful for constructing a second nucleic acid 
20 molecule of this invention are well-known to those of skill in the art and include 
genetic engineering, recombinant engineering, and synthetic techniques. See, e.g., 
Sambrook et al, cited above; and the international patent publications cited above. 
C Helper functions 

Helper functions essential to the replication and packaging of 
25 rAAV are also provided by or to the host cell in a variety of ways. For example, 

essential helper functions may be provided by the molecules (a) and (b) which contain, 
for example, adenovirus gene sequences, as described above. As another example, at 
least one of the molecules (a) or (b) may be a recombinant virus, which also supplies 
some or all helper functions to the cell. 
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Alternatively, helper functions may be provided by the host cell 
by virtue of sequences integrated into the chromosome of the cell. For example, the 
host cell may be an adenovirus or herpesvirus packaging cell, i.e., it expresses 
adenovirus or herpesvirus proteins useful for the production of AAV, such as HEK 
5 293 cells and other packaging cells. In the case where the helper functions are 

expressed by the selected host ceil, or by the host cell transfected with (a) or (b), no 
additional molecules are required. 

However, where a packaging cell line is not used as the host 
cell, or the helper functions are not sufficiently present, still another source of helper 
10 functions is a third nucleic acid molecule. In one embodiment this third nucleic acid 
molecule is a plasmid which contains helper functions. See, for example, the "helper" 
plasmids of Fig 2A and Fig. 2B, which contain adenovirus sequences in a plasmid 
backbone. 

In another embodiment, the third molecule is a recombinant or 

15 wild-type helper virus, such as an adenovirus, baculovirus, retrovirus or herpesvirus, 
which provides the helper functions. Whether the optional third molecule is a plasmid 
or virus, it may exist in the cell as an episome. Where the helper functions are 
available on a separate molecule, the "host cell" may be any mammalian cell and not 
necessarily a packaging cell, such as HEK 293. Examples of suitable parental host 

20 cell lines include, without limitation, HeLa, A549, KB, Detroit, and WI-38 cells. 

These cell lines are all available from the American Type Culture Collection, 10801 
University Boulevard, Manassa, Virginia 201 10-2209. Other suitable parent cell lines 
may be obtained from other sources. 

Examples 1-3 below illustrate useful molecules and host cells of 

25 this invention. Using the information provided herein and known techniques, one of 
skill in the art could readily construct a different recombinant virus (i.e., non- 
adenovirus) or a plasmid molecule which is capable of driving expression of the 
selected component in the host cell. For example, although less preferred because of 
their inability to infect non-dividing cells, vectors carrying the required elements of the 

30 first or second nucleic acid molecules, e.g., the P5-spacer-re/?-cop or the ITR- 
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transgene-ITR sequences, may be readily constructed using e.g., retroviruses or 
baculoviruses. Therefore, this invention is not limited by the virus or plasmid selected 
for purposes of introducing the essential elements of the first nucleic acid sequence or 
second nucleic acid sequence or the optional third nucleic acid sequence into the host 
5 cell. 

IL Methods of the Invention 

In another embodiment, the present invention provides a method for 
increasing the production of rAAV by decreasing the expression of the rep7& and 
rep6S gene products, keeping the expression of rep52 and rep40, and the cap gene 

10 products at normal levels. This method includes the steps of culturing a host cell 
described above, which contains nucleic acid molecules (a) and (b), and helper 
functions (c), as described above; and isolating from the cell lysate or cell culture, a 
recombinant AAV capable of expressing the transgene of molecule (b). 

In one embodiment of the method, a selected host cell is co-transfected 

15 with the first and second nucleic acid molecules, as described above, and then infected 
with a wild-type (wt) or replication defective virus, or transfected helper plasmid, to 
supply the helper functions. Suitable helper viruses may be readily selected by those 
of skill in the art and include, for example, wt Ad2, wt Ad5, and herpesviruses, as well 
as the replication defective adenovirus dl309. Suitable helper plasmids may also be 

20 readily selected by those of skill in the art and include, for example, the pFG140, 

pFA13, and pBHGlO, which are described herein. In another embodiment, the host 
cell is an adenovirus packaging cell, such as HEK 293, and the first or second nucleic 
acid molecule is a recombinant virus, which also contains the remaining adenovirus 
helper functions necessary to package AAV in the presence of the essential elements 

25 provided by (b) and (c). Selection of the means by which the helper functions are 
provided is not a limitation on the present invention. 

Suitable techniques for introducing the molecules of this invention into 
the host cell are known to those of skill in the art. When all molecules or vectors are 
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present in a cell and the cell is provided with helper functions, the rAAV is efficiently 
produced. 

In another embodiment of the method of this invention, a packaging 
cell line is constructed which stably expresses the helper functions (c), or which 
5 expresses the first nucleic molecule (a). According to this aspect of the method, the 
cell line expressing the (c) or (a) elements can be substituted for the vector or plasmid 
(a) or (c) as described above. Thus, only the second molecule (i.e., the cis plasmid) 
described above is subsequently introduced into the cell. 

Having obtained such a helper-expressing cell line, this cell line can be 
10 infected (or transfected) with the first vector (a) containing the rep and cap genes and 
the second vector (b) containing the minigene described above. 

The methods of this invention demonstrate that the limiting step for the 
high yield of rAAV is not the replication of cis plasmid; but the packaging process and 
replS and rep6& can interfere with the packaging process directly or indirectly. 

15 m. Production of Vectors and rAAV 

Assembly of the selected DNA sequences contained within each of the 
molecules described above may be accomplished utilizing conventional techniques. 
Such techniques include cDNA cloning such as those described in texts [Sambrook et 
al, cited above], use of overlapping oligonucleotide sequences of the adenovirus, 

20 AAV genome combined with polymerase chain reaction, synthetic methods, and any 
other suitable methods which provide the desired nucleotide sequence. 

Introduction of the molecules (as plasmids or viruses) into the host cell 
may also be accomplished using techniques known to the skilled artisan. Where 
appropriate, standard transfection and co-transfection techniques may be employed, 

25 e.g., CaP0 4 transfection or electoporation using the human embryonic kidney (HEK) 
293 cell line (a human kidney cell line containing a functional adenovirus Ela gene 
which provides a transacting Ela protein). Other conventional methods employed in 
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this invention include homologous recombination of the viral genomes, plaquing of 
viruses in agar overlay, methods of measuring signal generation, and the like. 

Following infection/transfection, the host cell is then cultured to enable 
production of the rAAV [See, e.g., F. L. Graham and L. Prevec, Methods Mol. BioL 
5 7:109-128 (1991), incorporated by reference herein]. Desirably, once the rAAV is 
identified, it may be recovered and purified using standard techniques. 

The following examples illustrate the preferred methods of the 
invention. These examples are illustrative only and are not intended to limit the scope 
of the invention. 

10 Example 1 - Construction of First Nucleic Acid Molecules 
A. Tram Plasmids 

An exemplary first molecule of the present invention is 
provided as a plasmid containing the P5--spacer--AAV rep and cap genes as follows. 
See Figs. 1A and IB. 

15 The AAV P5 promoter was amplified from the 121 bp Xbal- 

BamHI fragment from plasmid psnb20J, which contains the entire AAV2 genome 
[RJ. Samulski et al, J Virol. . 61:3096-3101 (1987)] by PCR using two 
oligonucleotides: 

oligo 1 : TGT AGT TAA TGA TTA ACC CGC CAT GCT 
20 ACT TAT C [SEQ ID NO: 2] and oligo 2: GGC GGC TGC GCG TTC AAA CCT 
CCC GCT TCA AAA TG [SEQ ID NO: 3]. This P5 promoter sequence was 
subsequently cloned into plasmid pCR2.1 (Invitrogen), resulting in a new plasmid, 
pCR-P5. The AAV rep and cap coding region is amplified from the AAV type 2 virus 
by primers TATTTAAGCCCGAGTGAGCT [SEQ ID NO: 4] and 
25 TAGCATGGCGGGTTAATCATTAACTACA [SEQ ID NO: 5] and cloned into the 
unique Smal site of pBluescript (Promega). The resulting plasmid is called pBS- 
AAV. 

The P5 promoter is then excised from pCR-P5 by digestion 
with BamHI and Xhol, filled in by Klenow and then cloned into the Clal site of 
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plasmid pBS-AAV. The resulting plasmid, designated as P5-X, contains a unique 
EcoRV site between the P5 promoter and the initiation codon of rep7&. 

The helper plasmid (i.e., first nucleic acid molecule) is made by 
cloning the desired spacer, in this case, either the X phage or yeast 100 bp ladder and 
5 500 bp ladder sequences (Gibco; BRL) into the EcoRV site in P5-X. The resulting 
series of plasmids are designated as pJWX-Y (Fig. 1 A). Reference to Table I codifies 
these plasmids as pJWX-Y, in which Y indicates the size of the plasmid. The spacer 
sizes present in these plasmids range from 100 bp to 8 kb. Fig. 1 A represents the 
normal relationship of P5 to the rep and cap genes. Fig. IB represents the P5-spacer- 
10 rep-cap configuration of the first nucleic molecule. 

Example 2 - Construction of Second Nucleic Acid Molecule 

A "cis" plasmid, or second nucleic acid molecule useful in the present 
invention contains a minigene comprising AAV ITRs flanking a promoter and 
transgene, the minigene inserted into a plasmid backbone. In the present example, the 

15 exemplary cis plasmid is AV.CMVLacZ [SEQ ID NO: 1 ; see International Patent 
Application NO. W095/13598] was utilized as the cis plasmid (the second nucleic 
acid molecule) useful in the methods of this invention. It is a rAAV cassette in which 
AAV rep and cap genes are replaced with a minigene expressing B-galactosidase from 
a CMV promoter. The linear arrangement of pAV.CMVZacZ includes: 

20 (a) the 5' AAV ITR (bp 1-173) obtained by PCR using pAV2 [C. A. 

Laughlin et al, Gene . 23: 65-73 (1983)] as template [nucleotide numbers 365-538 of 
SEQ ID NO: 1]; 

(b) a CMV immediate early enhancer/promoter [Boshart et al, 
Cell . 41:521-530 (1985); nucleotide numbers 563-1157 of SEQ ID NO:lj, 
25 (c) an S V40 intron (nucleotide numbers 1 1 78- 1 1 79 of SEQ ID 

NO:l), 

(d) E. coli beta-galactosidase cDNA (nucleotide numbers 1356 - 
4827 of SEQ ID NO:l), 
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(e) an SV40 polyadenylation signal (a 237 BamHI-BclI restriction 
fragment containing the cleavage/poly-A signals from both the early and late 
transcription units; nucleotide numbers 4839 - 5037 of SEQ ID NO:l) and 

(f) 3'AAV ITR, obtained from pAV2 as a SnaBI-Bglll fragment 

5 (nucleotide numbers 5053 - 5221 of SEQ ID NO: 1). The remainder of the plasmid is 
simply plasmid backbone from a pBR322-derivative. 

Example 3 - Production of rAAV 

According to one embodiment of the present invention, 5x1 0 6 HEK 

10 293 cells (American Type Culture Collection, Rockville, Maryland) were transfected 
as follows: 2 \xg of the helper plasmid pFaB (Fig. 2B), 1 \xg cis plasmid 
(pAV.CMVIacZ [SEQ ID NO: 1] of Example 2) and 1 fig of a trans plasmid selected 
from the groups listed in Table I, were transfected into 293 cells using DOTAP 
(Boehringer Mannheim Biotech). 

15 Forty-eight hours later, each group of cells were harvested [J. Price et 

al, Proc. Natl. Acad. ScL USA . 84:156-160 (1987)]. The cell lysate was then 
subjected to three rounds of freeze-thaw cycles. The amount of rAAV virus in 
supernatant was then titer by x-gal assay. To get pure rAAV virus, the cell lysate can 
be purified by CsCl gradient. 

20 Table I lists the identity of the first nucleic acid molecule (i.e., the trans 

plasmid), the size of the spacer therein, and the total yield of rAAV from 2X1 0 7 cells 
in two production experiments was reported as LacZ-forming units (LFU). In Table I 
below 1 unit represent 1 xlO 5 LFU. In this case, p Ad AAV represents a helper 
plasmid containing no spacer between P5 and repcap (see Fig. 1 A). 
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Table I 



Total Yield 



p i rans piasmta 


spacer 


i at 






pAdAAV 


none 


0.6 


2.6 


1.6 


pJWX-100 


100 bp 


1 


8.7 


4.9 


pJWX-200 


2x100 bp repeats 


22 


33.4 


28 


pJWX-300 


3x100 bp repeats 


25 


70 


48 


pJWX-400 


4x100 bp repeats 


29 


60 


45 


pJWX-500 


5x100 bp repeats 


24 


55 


40 


pJWX-600 


6x100 bp repeats 


31 


130 


81 


pJWX-700 


7x100 bp repeats 


26 


31.4 


29 


pJWX-800 


8x100 bp repeats 


29 


28.3 


29 


pJWX-900 


9x100 bp repeats 


N/A 


24 


24 


pJWX-1 k 


10x100 bp repeats 


N/A 


33 


33 


pJWX-1.1 k 


1 1x100 bp repeats 


15 


29 


22 


pJWX-0.5 k 


500 bp insert 


30 


72 


51 


pJWX-1 k 


1 kb insert 


25 


40 


33 


pJWX-1.5k 


1.5 kb insert 


16 


19 


18 


pJWX-2 k 


2kb insert 


20 


21 


21 


pJWX-2.5 k 


2.5kb insert 


19 


32 


26 


pJWX-3 k 


3.02 kb insert 


20 


17 


19 


pJWX-3.5 k 


3.5 kb insert 


21 


18.4 


20 


pJWX-4 k 


4.01 kb insert 


26 


12.7 


19 


pJWX-4.5 k 


4.5kb insert 


16 


25 


21 


pJWX-5 k 


5.01 kb insert 


25 


13.2 


19 


pJWX-8k 


8 kb insert 


27 


17.4 


22 



Surprisingly the rAAV yield is greatly improved using these helper 

plasmids. 



Example 4 - Further Characterization of the rAAV Produced bv the Method 

30 A. Western blots 

A Western blot was performed on transfections using a variety 
of different trans plasmids having different spacer sequences (Example 3). The results 
showed that the expression of replZ and rep6S from these rAAV was greatly reduced 
while the expression of rep52, repAO and AAV structural proteins remained 

35 unchanged. The results showed that replication of the gene containing cis plasmid 
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was not significantly affected even though the amount of replS and rep6S was 
decreased. 

Therefore, the optimization of rep7& and rep6& expression is 
critical for high titer rAAV production. 
5 B. Southern Blot 

A Southern blot was performed on transfections using the 
method of this invention using as "trans" plasmids or first nucleic acid molecules, 
either mock (no trans plasmid), AdAAV (a plasmid containing no spacer between the 
P5 and rep ATG site), or trans plasmids of the present invention containing varying 
10 sizes of spacer. 

The position of the dimer and monomer bands did not change 
across all trans plasmids used. This demonstrates that the spacers between the P5 
promoter and rep and cap genes of the trans plasmids of the invention do not affect 
the replication of the cis plasmid in the method. The AAV rep7$/6S expression is 
15 sufficient in the method of this invention to maintain normal AAV replication. 

Example 5 - Titer Comparisons 

The following two tables of data represent studies in which the 
methods of the present invention were performed by, (a) transfecting 293 cells by 
calcium phosphate precipitation with the trans plasmids identified in each table, the cis 

20 plasmid of Example 2 and the pFa13 helper plasmid of Fig. 2A or (b) transfecting 

5xl0 6 HEK 293 cells (American Type Culture Collection, Manassas, Virginia) with 1 
jig cis plasmid (pAV.CMVLarcZ [SEQ ID NO: 1] of Example 2) and 1 jxg of a trans 
plasmid selected from the groups listed in Table III using Lipofectin (Gibco BRL). 
These cells were thereafter infected with wildtype adenovirus type 5 at an MOI of 5 to 

25 supply the helper functions. 

Table II illustrates the virus titers of three trials using protocol (a). 
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TABLE II 



Trans Plasmid 


Trial 1 


Trial 2 


Trial 3 


Mock (no trans) 


0 


0 


0 


AdAAV (no spacer) 


100 


100 


100 


pJWX-23 


693 


391 


838 


pJWX-4k 


344 


330 


444 


pJWX-lk 


441 


321 


475 


pJWX-500 


344 


278 


437 



Table III illustrates the titers of one trial using protocol (b). In this 
10 case a titer unit of 1 is equivalent to 3xl0 6 infectious particles. 

TABLE III 





Trans plasmid 


Titer 




Mock 


0.0 


15 


AdAAV 


0.9 




pJWX-100 


3.4 




pJWX-200 


2.4 




pJWX-300 


5.4 




pJWX-400 


3.9 


20 


pJWX-500 


7.2 




pJWX-600 


3.6 




pJWX-700 


2.3 




pJWX-800 


3.0 




pJWX-900 


3.6 


25 


pJWX-lk 


2.5 




pJWX-1.5k 


1.0 




pJWX-2k 


2.0 



Publications cited in this specification are incorporated herein by 
reference. Numerous modifications and variations of the present invention are 
30 included in the above-identified specification and are expected to be obvious to one of 
skill in the art. Such modifications and alterations to the processes of the present 
invention are believed to be encompassed in the scope of the claims appended hereto. 
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WHAT IS CLAIMED IS: 

1 . A recombinant host cell containing 

(a) a first nucleic acid molecule comprising, from 5' to 3', a 
parvovirus P5 promoter, a spacer, an AAV rep sequence and an AAV cap gene 
sequence, wherein said spacer is of sufficient size to reduce expression of the replS 
and rep6S gene products relative to other rep gene products, and 

(b) a second nucleic acid molecule comprising a minigene 
comprising a transgene flanked by AAV inverse terminal repeats (ITRs) and under the 
control of regulatory sequences directing expression thereof in a host cell; and 

(c) helper functions essential to the replication and 

packaging of rAAV. 

2. The cell according to claim 1 wherein said spacer is between 
about 10 bp to 10 kb in length. 

3. The cell according to claim 1 wherein the spacer sequence is 
between 100 bp to 3.8 kb in length. 

4. The cell according to claim 1 wherein said spacer sequence is 
about 500 bp in length. 

5. The cell according to claim 1 wherein the spacer sequence is a 
random sequence of nucleotides. 

6. The cell according to claim 1 wherein said spacer sequence 
encodes a gene product. 

7. The cell according to claim 1 wherein said first nucleic acid 
molecule is a plasmid. 
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8. The cell according to claim 1 wherein said first nucleic acid 
molecule is a recombinant virus. 

9. The cell according to claim 1 wherein said first nucleic acid 
molecule is present in said cell as an episome. 

10. The cell according to claim 1 wherein said first nucleic acid 
molecule is integrated into the chromosome of said cell. 

1 1 . The cell according to claim 1 wherein said promoter is AAV 
type 2 P5 promoter. 

12. The cell according to claim 1 wherein said parvovirus promoter 
is the P5 promoter from a strain of AAV selected from the group consisting of AAV 
type 1, AAV type 3, AAV type HI. 

13. The cell according to claim 1 wherein said second nucleic acid 
molecule is a plasmid. 

14. The cell according to claim 1 wherein said second nucleic acid 
molecule is a recombinant virus. 

15. The cell according to claim 1 wherein said second nucleic acid 
molecule is present in said cell as an episome. 

16. The cell according to claim 1 wherein said second nucleic acid 
molecule is integrated into the chromosome of said cell. 



17. The cell according to claim 1 wherein said helper functions are 
provided by a third nucleic acid molecule. 
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18. The cell according to claim 17 wherein said third nucleic acid 
molecule is a plasmid. 

19. The cell according to claim 17 wherein said third nucleic acid 
molecule is a recombinant or wild-type virus. 

20. The cell according to claim 17 wherein said third nucleic acid 
molecule is present in said cell as an episome. 

21. The cell according to claim 17 wherein said third nucleic acid 
molecule is integrated into the chromosome of said cell. 

22. The cell according to claim 1 which is derived from a HEK 293 

cell. 

23. A nucleic acid molecule comprising from 5' to 3': a parvovirus 
P5 promoter, a spacer, an AAV rep gene sequence and an AAV cap gene sequence, 
wherein said spacer is of sufficient size to reduce expression of the replS and rep6& 
gene products relative to other rep gene products. 

24. The molecule according to claim 23 wherein said spacer is 
between about 10 bp to 10 kb in length. 

25. The molecule according to claim 23 wherein the spacer 
sequence is between 100 bp to 3.8 kb in length. 

26. The molecule according to claim 23 wherein said spacer 
sequence is about 500 bp in length. 
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27. The molecule according to claim 23 wherein the spacer 
sequence is a random sequence of nucleotides. 

28. The molecule according to claim 23 wherein said spacer 
sequence encodes a gene product. 

29. The molecule according to claim 23 which is a plasmid. 

30. The molecule according to claim 23 which is a recombinant 

virus. 

31. The molecule according to claim 23 which is integrated into the 
chromosomes of a cell. 

32. The molecule according to claim 23 which is present episomally 

- in a cell. 

33. A method for producing recombinant adeno-associated virus 
(AAV), said method comprising the steps of 

(a) culturing a recombinant host cell containing 

(i) a first nucleic acid molecule comprising from 5 1 
to 3': a parvovirus P5 promoter, a spacer, an AAV rep gene sequence and an AAV 
cap gene sequence, wherein said spacer is of sufficient size to reduce expression of 
the repl% and rep6S gene products relative to other rep gene products, and 

(ii) a second nucleic acid molecule comprising a 
minigene comprising a transgene under the control of regulatory sequences directing 
expression thereof in a host cell and flanked by AAV inverse terminal repeats (ITRs); 

(iii) an optional third nucleic acid molecule which 
provides helper functions essential to the replication and packaging of rAAV, which 
functions are not provided by (i) or (ii) or said host cell; and 
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(b) isolating from said cell or cell culture, a recombinant 
AAV capable of expressing said transgene. 




X 
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SEQUENCE LISTING 

<110> Wilson, James M. 
Xiao, Weidong 

The Trustees of the University of Pennsylvania 



<120> Methods and Vector Constructs Useful for Production of 
Recombinant AAV 



<130> GNVPN. 027CIP1PCT 



<140> 
<141> 

<150> 60/059,330 
<151> 1997-09-19 



<160> 5 



<170> Patentln Ver. 2.0 



<210> 1 
<211> 8509 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Recombinant 
plasmid containing Adeno-Associated Virus 
sequences, CMV promoter & lacZ gene. 



<400> 1 

gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctgcgcgc 60 

tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc 120 

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc 180 

cttgtagtta atgattaacc cgccatgcta cttatctacg tagccattct ctagcccctg 240 

caggtcgtta cataacttac ggtaaatggc ccgcctggct gaccgcccaa cgacccccgc 300 

ccattgacgt caataatgac gtatgttccc atagtaacgc caatagggac tttccattga 360 

cgtcaatggg tggagtattt acggtaaact gcccacttgg cagtacatca agtgtatcat 420 

atgccaagta cgccccctat tgacgtcaat gacggtaaat ggcccgcctg gcattatgcc 480 

cagtacatga ccttatggga ctttcctact tggcagtaca tctacgtatt agtcatcgct 540 

attaccatgg tgatgcggtt ttggcagtac atcaatgggc gtggatagcg gtttgactca 600 

cggggatttc caagtctcca ccccattgac gtcaatggga gtttgttttg gcaccaaaat 660 

caacgggact ttccaaaatg tcgtaacaac tccgccccat tgacgcaaat gggcggtagg 720 

cgtgtacggt gggaggtcta tataagcaga gctcgtttag tgaaccgtca gatcgcctgg 780 

agacgccatc cacgctgttt tgacctccat agaagacacc gggaccgarc cagcctccgg 840 

actctagagg atccggtact cgaggaactg aaaaaccaga aagttaactg gtaagtttag 900 

tctttttgtc ttttatttca ggtcccggat ccggtggtgg tgcaaatcaa agaactgctc 960 
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ctcagtggat gttgccttta cttctaggcc tgtacggaag tgttacttct gctctaaaag 1020 
ctgcggaatt gtacccgcgg ccgcaattcc cggggatcga aagagcctgc taaagcaaaa 1080 
aagaagtcac catgtcgttt actttgacca acaagaacgt gattttcgtt gccggtctgg 1140 
gaggcattgg tctggacacc agcaaggagc tgctcaagcg cgatcccgtc gttttacaac 1200 
gtcgtgactg ggaaaaccct ggcgttaccc aacttaatcg ccttgcagca catccccctt 1260 
tcgccagctg gcgtaatagc gaagaggccc gcaccgatcg cccttcccaa cagttgcgca 1320 
gcctgaatgg cgaatggcgc tttgcctggt ttccggcacc agaagcggtg ccggaaagct 1380 
ggctggagtg cgatcttcct gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc 1440 
acggttacga tgcgcccatc tacaccaacg taacctatcc cattacggtc aatccgccgt 1500 
ttgttcccac ggagaatccg acgggttgtt actcgctcac atttaatgtt gatgaaagct 1560 
ggctacagga aggccagacg cgaattattt ttgatggcgt taactcggcg tttcatctgt 1620 
ggtgcaacgg gcgctgggtc ggttacggcc aggacagtcg tttgccgtct gaatttgacc 1680 
tgagcgcatt tttacgcgcc ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg 1740 
acggcaguta tctggaagat caggatatgt ggcggatgag cggcattttc cgtgacgtct 1800 
cgttgctgca taaaccgact acacaaatca gcgatttcca tgttgccacc cgctttaatg 1860 
atgatttcag ccgcgctgta ctggaggctg aagttcagat gtgcggcgag ttgcgtgact 1920 
acctacgggt aacagtttct ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc 1980 
ctttcggcgg tgaaattatc gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc 2040 
tgaacgtcga aaacccgaaa ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg 2100 
ttgaactgca caccgccgac ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc 2160 
gcgaggtgcg gatcgaaaat ggtctgctgc tgctgaacgg caagccgttg ctgattcgag 2220 
gcgttaaccg tcacgagcat catcctctgc atggtcaggt catggatgag cagacgatgg 2280 
tgcaggatat cctgctgatg aagcagaaca actttaacgc cgtgcgctgt tcgcattatc 2340 
cgaaccatcc gctgtggtac acgctgtgcg accgctacgg cctgtatgtg gtggatgaag 2400 
ccaatattga aacccacggc atggtgccaa tgaatcgtct gaccgatgar ccgcgctggc 2460 
taccggcgat gagcgaacgc gtaacgcgaa tggtgcagcg cgatcgtaac cacccgagtg 2520 
tgatcatctg gtcgctgggg aatgaatcag gccacggcgc taatcacgac gcgctgtatc 2580 
gctggatcaa atccgtcgat ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca 2640 
ccacggccac cgatattatt tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc 2700 
cggctgcgcc gaaatggtcc atcaaaaaat ggctttcgct acctggagag acgcgcccgc 2760 
tgatcctttg cgaauacgcc cacgcgatgg gtaacagtct tggcggtttc gctaaatact 2820 
ggcaggcgtt tcgtcagtat ccccgtttac agggcggctt cgtctgggac tgggtggatc 2880 
agtcgctgat taaacatgat gaaaacggca acccgtggtc ggcttacggc ggtgattttg 2940 
gcgatacgcc gaacgaccgc cagttctgta tgaacggtct ggtctttgcc gaccgcacgc 3000 
cgcatccagc gccgacggaa gcaaaacacc agcagcagtt tttccagttc cgtttatccg 3060 
ggcaaaccat cgaagtgacc agcgaatacc tgttccgtca tagcgataac gagctcctgc 3120 
actggatggt ggcgctggat ggtaagccgc tggcaagcgg tgaagtgccn ctggatgtcg 3180 
ctccacaagg taaacagttg attgaactgc ctgaactacc gcagccggag agcgccgggc 3240 
aactctggct cacagtacgc gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc 3300 
acatcagcgc ctggcagcag tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg 3360 
cgtcccacgc catcccgcat ctgaccacca gcgaaatgga tttttgcatc gagctgggta 3420 
ataagcgttg gcaatttaac cgccagtcag gctttctttc acagatgtgg attggcgata 3480 
aaaaacaact gccgacgccg ctgcgcgatc agttcacccg tgcaccgctg gataacgaca 3540 
ttggcgtaag tgaagcgacc cgcattgacc ctaacgcctg ggtcgaacgc cggaaggcgg 3600 
cgggccazra ccaggccgaa gcagcgttgt tgcagtgcac ggcagataca cttgctgatg 3660 
cggtgccgat tacgaccgct cacgcgtggc agcatcaggg gaaaacctta tttatcagcc 3720 
ggaaaaccca ccggattgat ggtagtggtc aaacggcgat taccgttgai: gttgaagtgg 3780 
cgagcgacac accgcatccg gcgcggattg gcctgaactg ccagctggcg caggtagcag 3840 



2 



WO 99/14354 



PCT/US98/19479 



agcgggtaaa ctggctcgga ttagggccgc 
cctgttttga ccgctgggat ctgccattgt 
gcgaaaacgg tctgcgctgc gggacgcgcg 
gcgacttcca gttcaacatc agccgctaca 
gccatctgct gcacgcggaa gaaggcacat 
ttggtggcga cgactcctgg agcccgtcag 
gctaccatta ccagttggtc tggtgtcaaa 
ccgtatttcg cgtaaggaaa tccattatgt 
tcggtttatt ctttttcttt tactttttta 
tt'tggctaca tgacatcaac catatcagca 
tttctctgtt ctcgctatta ttccaaccgc 
tcgactccag gcggccgcgg ggatccagac 
accacaacta gaatgcagtg aaaaaaatgc 
ttatttgtaa ccattataag ctgcaacaaa 
atgtttcagg ttcaggggga ggtgtgggag 
aggggctaga atggctacgt agataagtag 
cccctagtga tggagttggc cactccctct 
cgaccaaagg tcgcccgacg cccgggcttt 
cgcagctggc gtaatagcga agaggcccgc 
ctgaatggcg aatggaattc cagacgattg 
tttttcctgt tgcaatggct ggcggtaata 
gtttgagttc ttctactcag gcaagtgatg 
cggttaattt gcgtgatgga cagactcttt 
cttctcagga ttctggcgta ccgttcctgt 
gctcccgctc tgactctaac gaggaaagca 
tacgcgccct gtagcggcgc attaagcgcg 
gctacacttg ccagcgccct agcgcccgct 
acgttcgccg gctttccccg tcaagctcta 
agtgctttac ggcacctcga ccccaaaaaa 
ccatcgccct gatagacggt ttttcgccct 
ggactcttgt tccaaactgg aacaacactc 
taagggattt tgccgatttc ggcctattgg 
aacgcgaatt ttaacaaaat attaacgttt 
ctgttttcgg ggcttttctg attatcaacc 
cgattaccgt tcaLcgattc tcttgtttgc 
tttgtagaga ccictcaaaa atagctaccc 
ttgaatatca tattgatggt gatttgactg 
tacctacaca ttactcaggc attgcattta 
cttgcgttga aataaaggct tctcccgcaa 
caaccgattt agctttatgc tctgaggctt 
gcctgtatga tttattggat gttggaattc 
tgcggtattt cacaccgcat atggtgcact 
ttaagccagc cccgacaccc gccaacaccc 
ccggcatccg ctnacagaca agctgtgacc 
tcaccgtcat caccgaaacg cgcgagacga 
gttaatgtca tgacaataat ggtttcttag 
cgcggaaccc ctatttgttt atttttctaa 
caataaccct gacaaatgct tcaataatat 



aagaaaacta tcccgaccgc cttactgccg 3900 
cagacatgta taccccgtac gtcttcccga 3S60 
aattgaatta tggcccacac cagtggcgcg 4020 
gtcaacagca actgatggaa accagccatc 4080 
ggctgaatat cgacggtttc catatgggga 4140 
tatcggcgga attacagctg agcgccggtc 4200 
aataataata accgggcagg ccatgtctgc 4260 
actatttaaa aaacacaaac ttttggatgt 4320 
tcatgggagc ctacttcccg tttttcccga 4380 
aaagtgatac gggtattatt tttgccgcta 4440 
tgtttggtct gctttctgac aaactcggcc 4500 
atgataagat acattgatga gtttggacaa 4560 
tttatttgtg aaatttgtga tgctattgct 4620 
caagttaaca acaacaattg cattcatttt 4680 
gttttttcgg atcctctaga gtcgacctgc 4740 
catggcgggt taatcattaa ctacaaggaa 4800 
ctgcgcgctc gctcgctcac tgaggccggg 4860 
gcccgggcgg cctcagtgag cgagcgagcg 4920 
accgatcgcc cttcccaaca gttgcgcagc 4980 
agcgtcaaaa tgtaggtatt tccatgagcg 5040 
ttgttctgga tattaccagc aaggccgata 5100 
ttattactaa tcaaagaagt attgcgacaa 5160 
tactcggtgg cctcaccgat tataaaaaca 5220 
ctaaaatccc tttaatcggc ctcctgttta 5280 
cgttatacgt gctcgtcaaa gcaaccatag 5340 
gcgggtgtgg tggttacgcg cagcgtgacc 5400 
cctttcgctt. tcttcccttc ctttctcgcc 5460 
aatcgggggc tccctttagg gttccgattt 5520 
cttgattagg gtgatggtcc acgtagtggg 5580 
ttgacgttgg agtccacgtr ctttaatagt 5640 
aaccctatct cggtctattc ttttgattta 5700 
ttaaaaaatg agctgattta acaaaaattt 5760 
acaatttaaa tatttgctca tacaatctic 5820 
ggggtacata tgattgacac gctagtttta 5880 
uccagactct caggcaatga cctgacagcc 5940 
tctccggcat gaatttatca gctagaacgg 6000 
tctccggcct ttctcacccg tttgaatctt 6060 
aaatatatga gggttccaaa aatttttatc 6120 
aagtattaca gggtcataat gtttttggta 6180 
tattgcttaa ttttgctaat tctttgcctt 6240 
ctgatgcggt attttctcct tacgcatctg 6300 
ctcagtacaa tctgctctga tgccgcatag 6360 
gctgacgcgc cctgacgggc ttgtctgctc 6420 
gtctccggga gctgcatgtg tcagaggttt 6480 
aagggcctcg tgatacgcct atttttatag 6540 
acgtcaggtg gcacttctcg gggaaargcg 6600 
atacattcaa atatgtaucc gctcatgaga 6660 
rgaaaaagga agagtatgag tattcaacat 6720 
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ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca 6780 
gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc 6840 
gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca 6900 
atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg 6S60 
caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca 7020 
gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata 7080 
accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag 7140 
ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg 7200 
gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca 7260 
acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta 7320 
atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct 7380 
ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca 7440 
gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag 7500 
gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat 7560 
tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt 7620 
taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa 7680 
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 7740 
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 7800 
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 7860 
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 7920 
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 7980 
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 8040 
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 8100 
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 8160 
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 8220 
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 8280 
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 8340 
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 8400 
tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 8460 
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagc 8509 

<210> 2 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 



<210> 3 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 



<400> 2 



tgtagttaat gattaacccg ccatgctact tatc 
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<223> Description of Artificial Sequence: PCR primer 



<400> 3 



ggcggctgcg cgttcaaacc tcccgcttca aaatg 
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<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 4 

tatttaagcc cgagtgagct 20 

<210> 5 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 



<400> 5 



tagcatggcg ggttaatcat taactaca 
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